Are Your Digital Documents Web Friendly?: Making Scanned Documents Web Accessible
نویسندگان
چکیده
منابع مشابه
Making Indian Language Legacy Documents Accessible Via Web
The reliable optical character recognition is not available for scripts of Indian languages. Thus, the only way to make legacy documents in Indian languages available on the web is by scanning them. This work is an attempt to cater to the need for a better representation and efficient storage technique for Indian language documents and their near perfect regeneration at the browser. We work wit...
متن کاملTowards Scalable Web Documents
Article summary. The current Web is running into serious scalability problems. The standard solution is to apply techniques like caching, replication, and distribution. Unfortunately, as the variety of Web applications continues to grow, it will be impossible to find a single solution that fits all needs. The authors advocate a different approach to tackling scaling problems. Instead of seeking...
متن کاملAdaptive Replicated Web Documents
Caching and replication techniques can improve latency of the Web, while reducing network traffic and balancing load among servers. However, no single strategy is optimal for replicating all documents. Depending on its access pattern, each document should use the policy that suits it best. This paper presents an architecture for adaptive replicated documents. Each adaptive document monitors its...
متن کاملWriting Web Documents about Films
This paper describes our experiences our experience in building and using a Web-based video library designed for educational use. The CAETI Internet Multimedia Library’s initial audience is K-12 schools; most of the content of our library comes from news and politics-related historical footage. The video library is a good tool not just for content but also for acquiring visual literacy. Politic...
متن کاملClustering Template Based Web Documents
More and more documents on theWorld WideWeb are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. Grouping together documents which are based on the same template is an important task for applications that analyse the template structure and need clean training data. This paper develops and compares several distance m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information Technology and Libraries
سال: 2010
ISSN: 2163-5226,0730-9295
DOI: 10.6017/ital.v29i3.3140